Tutorial on Hardware and Software Reliability, Maintainability, and Availability
نویسنده
چکیده
Description: Computer systems, whether hardware or software, are subject to failure. Precisely, what is a failure? It is defined as: The inability of a system or system component to perform a required function within specified limits. Afailure may be produced when a fault is encountered and a loss of the expected service to the user results [IEEE/AIAA P1633]. This brings us to the question of what is a fault? A fault is defect in the hardware or computer code that can be the cause of one or more failures. Software–based systems have become the dominant player in the computer systems world. Since it is imperative that computer systems operate reliably, considering the criticality of software, particularly in safety critical systems, the IEEE and AIAA commissioned the development of the Recommended Practice on Software Reliability. This tutorial serves as a companion document with the purpose of elaborating on key software reliability process practices in more detail than can be specified in the Recommended Practice. However, since other subjects like maintainability and availability are also covered, the tutorial can be used as a stand–alone document. While the focus of the Recommended Practice is software reliability, software and hardware do not operate in a vacuum. Therefore, both software and hardware are addressed in this tutorial in an integrated fashion. The narrative of the tutorial is augmented with illustrative solved problems. The recommended practice [IEEE P1633] is a composite of models and tools and describes the "what and how" of software reliability engineering. It is important for an organization to have a disciplined process if it is to produce high reliability software. This process uses a life cycle approach to software reliability that takes into account the risk to reliability due to requirements changes. A requirements change may induce ambiguity and uncertainty in the development process that cause errors in implementing the changes. Subsequently, these errors may propagate through later phases of development and maintenance. In view of the life cycle ramifications of the software reliability process, maintenance is included in this tutorial. Furthermore, because reliability and maintainability determine availability, the latter is also included.
منابع مشابه
RAM analysis of earth pressure balance tunnel boring machines: A case study
Earth pressure balance tunnel boring machines (EPB-TBMs) are favorably applied in urban tunneling projects. Despite their numerous advantages, considerable delays and high maintenance cost are the main disadvantages these machines suffer from. Reliability, availability, and maintainability (RAM) analysis is a practical technique that uses failure and repair dataset obtained over a reasonable ti...
متن کاملAnalysis of reliability and maintainability for multiple repairable units (Case study: Sungun copper mine)
The appropriate operating of mining machines is affected by both the executive and environmental factors. Considering the effects and the related risks lead to a better understanding of the failures of such machines. This leads to a proper prediction of the reliability parameters of such machines. In this research work, the reliability and maintainability analysis of the loading and haulage mac...
متن کاملA Quantitative Evaluation of Maintainability of Software Architecture Styles
Proper decisions play a crucial role in any software architecture design process. An important decision of design stage is the selection of a suitable software architecture style. Lack of investigation on the quantitative impact of architecture styles on software quality attributes is the main problem in using such styles. Consequently, the use of architecture styles in designing is based on th...
متن کاملReliability study of network element design architectures for operating on a remote terminal pole
The proliferation of different network elements to address the explosive demand for the transfer of information in the Internet age has caused hardware designers to develop products at a rapid rate. The increased pressure of time to market has at times led to design choices that have affected network element reliability. Instead of designing for optimum reliability performance, a design is quic...
متن کاملRAM investigation of coal-fired thermal power plants: A case study
Article history: Received 28 October 2011 Accepted December, 5 2011 Available online 8 December 2011 Continuous generation of electricity of a power plant depends on the higher availability of its components/equipments. Higher availability of the components/equipments is inherently associated with their higher reliability and maintainability. This paper investigates the reliability, availabilit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JACIC
دوره 7 شماره
صفحات -
تاریخ انتشار 2010